Perfect Allocation Methods for Spatial Queries in Parallel Disk Systems
نویسندگان
چکیده
A disk-allocation method assigns a disk-id to each unit of spatial data. Allocating spatial data over multiple disks to distribute the I/O cost of query processing uniformly over available disks can tremendously speed up the processing. An allocation method is called perfect for a query set if it balances the I/O load on each disk in processing any query in a query set. Some of the interesting query sets in spatial databases include path queries and orthogonal range queries. We provide a perfect allocation method for all path and orthogonal range queries for five and fewer disks. A non-intuitive result in this paper shows that there is no perfect allocation method for the set of orthogonal range queries for more than five disks. We examine the effectiveness of Latin Square and Linear allocation methods for the interesting query sets in this context. We show that Latin Square allocation methods can be perfect for a class of orthogonal range queries and linear allocation method can be perfect on a class of path queries as well as many classes of orthogonal range queries.
منابع مشابه
Evaluation of Disk Allocation Methods for Parallelizing Spatial Queries on Grid Files‡
Spatial Database Systems are characterized by large amounts of geometric and geographic data. Query response times in these systems are crucial, since these systems are often used interactively for decision support systems. The Grid file[1] is a well-known spatial access method that has great potential for parallelism, which reduces the response time of spatial queries for time-critical on-line...
متن کاملDisk Allocation Methods for Parallelizing Grid Files
The grid file [1] is a well known access method for multi-dimensional and spatial data. The response time needed to process path and range queries on the grid file access method can be improved significantly by distributing the data pages over multiple disks. This paper explores the disk allocation methods used to allocate the data pages of grid file among a set of disks, which can be accessed ...
متن کاملStatic Task Allocation in Distributed Systems Using Parallel Genetic Algorithm
Over the past two decades, PC speeds have increased from a few instructions per second to several million instructions per second. The tremendous speed of today's networks as well as the increasing need for high-performance systems has made researchers interested in parallel and distributed computing. The rapid growth of distributed systems has led to a variety of problems. Task allocation is a...
متن کاملDisk Scheduling for Intermediate Results of Large Join Queries in Shared-Disk Parallel Database Systems
In shared-disk database systems, disk access has to be scheduled properly to avoid unnecessary contention between processors. The first part of this report studies the allocation of intermediate results of join queries (buckets) on disk and derives heuristics to determine the number of processing nodes and disks to employ. Using an analytical model, we show that declustering should be applied e...
متن کاملOn Flexible Allocation of Index and Temporary Data in Parallel Database Systems
Data placement is a key factor for high performance database systems. This is particularly true for parallel database systems where data allocation must support both I/O parallelism and processing parallelism within complex queries and between independent queries and transactions. Determining an effective data placement is a complex administration problem depending on many parameters including ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007